Advantages of wideband over narrowband channels for speaker verification employing MFCCs and LFCCs
نویسندگان
چکیده
Wideband communications permit the transmission of an extended frequency range compared to the traditional narrowband. While benefits for automatic speaker recognition can be expected, the extent of the contribution of the additional bandwidth in wideband is still unclear. This work compares the i-vector speaker verification performances employing speech signals of 0-4 kHz, 4-8 kHz, and 0-8 kHz and different sets of cepstral features extracted using linearlyand a mel-spaced filterbanks. Analyses of clean speech and of speech transmitted through commonly employed codecs are conducted separately for male and for female speech. Our evaluation on two different datasets shows the improved speaker verification performance with the extended bandwidth, and also that the linear scale can lead to better results for narrowband signals. The advantages of linearover mel-scaled features for wideband depend on the speakers’ gender and on the channel distortion.
منابع مشابه
I-vector Speaker Verification for Speech Degraded by Narrowband and Wideband Channels
Voice biometrics are frequently exposed to channel degradations of transmitted speech and to channel mismatch between enrolment and test utterances, which cause speaker recognition systems to perform poorly. In this paper, the influence of channel bandwidth and speech coding on speaker verification is assessed employing the state-of-the-art i-vector technique. Our focus is on the possible benef...
متن کاملSpectral Sub-band Analysis of Speaker Verification Employing Narrowband and Wideband Speech
It is well known that the speaker discriminative information is not equally distributed over the spectral domain. However, it is still not clear whether that distribution is altered when the speech is transmitted through telecommunication channels, which introduce different kinds of degradations. In this paper we address the analysis of different frequency sub-bands when the speech is distorted...
متن کاملAnalysis of Automatic Speaker Verification Performance over Different Narrowband and Wideband Telephone Channels
Current speaker recognition applications involve the authentication of users by their voices for access to restricted information and privileges. The speech signal is often transmitted to the recognizer through communication channels presenting different transmission characteristics. The aim of this paper is to study the effects of speech bandwidth and coding schemes on speaker verification. We...
متن کاملI-vector speaker verification based on phonetic information under transmission channel effects
Past studies have shown evidence of important speakerspecific content in the higher frequencies of the spectrum, which are filtered out by narrowband channels. Besides, wideband transmissions, which are gaining ground over narrowband communications, offer an extended range of frequencies which account not only for better speech quality and intelligibility, but also for an improved speaker recog...
متن کاملMel, linear, and antimel frequency cepstral coefficients in broad phonetic regions for telephone speaker recognition
We’ve examined the speaker discriminative power of mel-, antimeland linear-frequency cepstral coefficients (MFCCs, aMFCCs and LFCCs) in the nasal, vowel, and non-nasal consonant speech regions. Our inspiration came from the work of Lu and Dang in 2007, who showed that filterbank energies at some frequencies mainly outside the telephone bandwidth possess more speaker discriminative power due to ...
متن کامل